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Abstract 

The standard LP relaxation of the asymmetric traveling salesman problem has 
been conjectured to have a constant integrality gap in the metric case. We prove this 
conjecture when restricted to shortest path metrics of node-weighted digraphs. Our 
arguments are constructive and give a constant factor approximation algorithm 
for these metrics. We remark that the considered case is more general than the 
directed analog of the special case of the symmetric traveling salesman problem 
for which there were recent improvements on Christofides' algorithm. 

The main idea of our approach is to first consider an easier problem obtained by 
significantly relaxing the general connectivity requirements into local connectivity 
conditions. For this relaxed problem, it is quite easy to give an algorithm with 
a guarantee of 3 on node-weigh ted shortest path metrics. More surprisingly, we 
then show that any algorithm (irrespective of the metric) for the relaxed problem 
can be turned into an algorithm for the asymmetric traveling salesman problem by 
only losing a small constant factor in the performance guarantee. This leaves open 
the intriguing task of designing a "good" algorithm for the relaxed problem on 
general metrics. 


Keywords: approximation algorithms, asymmetric traveling salesman problem, combinatorial 
optimization, linear programming 



1 Introduction 


The traveling salesman problem is one of the most fundamental combinatorial optimiza¬ 
tion problems. Given a set V of n cities and a distance/weight fimction w : V xV ^ R"*", 
it is the problem of finding a tour of minimum total weight that visits each city exactly 
once. There are two variants of this general definition: the symmetric traveling salesman 
problem (STSP) and the asymmetric traveling salesman problem (ATSP). In the symmet¬ 
ric version we assume w{u, v) = w{v, u) for each pair u,v e V of cities; whereas we make 
no such assumption in the more general asymmetric traveling salesman problem. 

In both versions, it is common to assume the triangle inequality and we shall do so 
in the rest of this paper. Recall that the triangle inequality says that for any triple i, j, k 
of cities, we have w{i, j) -l- w{i,k) > w{i,k). In other words, it is not more expensive to 
take the direct path compared to a path that makes a detour. Another equivalent view 
of the triangle inequality is that, instead of insisting that each city is visited exactly once, 
we should find a tour that visits each city at least once. These assumptions are arguably 
natural in many, if not most, settings. They are also necessary in the following sense: 
any reasonable approximation algorithm (with approximation guarantee 0(exp(n))) for 
the traveling salesman problem without the triangle inequality would imply P = NP 
because it would solve the problem of deciding Hamiltonicity. 

Understanding the approximability of the symmetric and the asymmetric traveling 
salesman problem (where we have the triangle inequality) turns out to be a much 
more interesting and notorious problem. On the one hand, the strongest known 
inapproximability results, by Karpinski, Lampis, and Schmied [14], say that it is 
NP-hard to approximate STSP within a factor of 123/122 and that it is NP-hard to 
approximate ATSP within a factor of 75/74. On the other hand, the current best 
approximation algorithms are far from these guarantees, especially in the case of ATSP. 

For the symmetric traveling salesman problem, Christofides' beautiful algorithm 
from 1976 sill achieves the best known approximation guarantee of 1.5 [7]. However, 
a recent series of papers [11, 16, 17, 19], broke this barrier for the interesting special 
case of shortest path metrics of unweighted undirected graphs^. Specifically, Oveis 
Gharan, Saberi, and Singh [11] first gave an approximation guarantee of 1.5 - e; Momke 
and Svensson [16] proposed a different approach yielding a 1.461-approximation 
guarantee; Mucha [17] gave a tighter analysis of this algorithm; and Sebo and Vygen [19] 
significantly developed the approach to give the current best approximation guarantee 
of 1.4. 

The interest in shortest path metrics has several motivations. It is a natural special 
case that seems to capture the difficulty of the problem: it remains APX-hard and the 
worst known integrality gap for the Held-Karp relaxation is of this type. Moreover, 
it has an attractive graph theoretic formulation: given an unweighted graph, find a 
shortest tour that visits each vertex at least once. This is the (unweighted) "graph" 
analog of STSR Indeed, if allow the graph to be edge-weighted, this formulation 
is equivalent to STSP on general metrics. Let us also mention that the polynomial 
time approximation scheme for the symmetric traveling salesman problem on planar 

^The shortest path metric of a graph G = {V, E) is defined as follows: the weight w{u, v) between cities 
u,v e V equals the shortest path between w and v in G. If the graph is node-weighted / : V —> the 
weight/length of an edge [u, i;) e E is /(«) + f{v). 
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graphs was first obtained for the special case of unweighted graphs [12], i.e., when 
restricted to shortest path metrics of unweighted graphs, and then generalized to the 
case of edge-weights [3]. For STSP, it remains a major open problem whether the ideas 
in [11, 16,17, 19] can be applied to general metrics. We further discuss this in Section 6. 

The gap in our understanding is much larger for the asymmetric traveling salesman 
problem for which it remains a notorious open problem to design an algorithm with any 
constant approximation guarantee. This is a particularly intriguing as the standard linear 
programming relaxation, often referred to as the Held-Karp relaxation, is only known 
to have an integrality gap of at least 2 [6]. There are in general two available approaches 
for designing approximation algorithms for ATSP in the literature. The first approach 
is due to Frieze, Galbiati, and Maffiolo [9] who gave a log 2 (n)-approximation algorithm 
for ATSP already in 1982. Their basic idea is simple and elegant: a minimum weight 
cycle cover has weight at most that of an optimal tour and it will decrease the number 
of connected components by a factor of at least 2. Hence, if we repeat the selection of a 
minimum weight cycle cover log 2 (u) times, we get a connected Eulerian graph which 
(by shortcutting) is a log 2 (n)-approximate tour. Although the above analysis is tight 
only in the case when almost all cycles in the cycle covers have length 2, it is highly 
non-trivial to refine the method to decrease the number of iterations. It was first in 2003 
that Blaser [5] managed to give an approximation guarantee of 0.999 log 2 (ft). This was 
improved shortly thereafter by Kaplan, Lewenstein, Shafrir and Sviridenko [13] who 
further developed this approach to obtain a 4/3 log 3 (n) w 0.841og2(n)-approximation 
algorithm; and later by Feige and Singh [8] who obtained an approximation guarantee 
of 2/31og2(n). 

A second approach was more recently proposed in an influential and beautiful 
paper by Asadpour, Goemans, Madry, Oveis Gharan, and Saberi [4] who gave an 
0(logn/loglogn)-approximation algorithm for ATSP. Their approach is based on 
finding a so-called a-thin sparming tree which is a (unweighted) graph theoretic 
problem. Here, the parameter a is proportional to the approximation guarantee so 
a = 0(log n/ log log n) in [4]. Following their publication, Oveis Gharan and Saberi [10] 
gave an efficient algorithm for finding 0(l)-thin spanning trees for planar and bounded 
genus graphs yielding a constant factor approximation algorithm for ATSP on these 
graph classes. Also, in a very recent major progress, Anari and Oveis Gharan [1] 
showed the existence of 0(polylog log n)-thin sparming trees for general instances. 
This implies a 0(polylog log n) upper bound on the integrality gap of the Held-Karp 
relaxation. Hence, it gives an efficient so-called estimation algorithm for estimating 
the optimal value of a tour within a factor 0(polylog log n) but, as their arguments are 
non-constructive, no approximation algorithm for finding a tour of matching guarantee. 
The result in [1] is based on developing and extending several advanced techniques. 
Notably, they rely on their extension [2] of the recent proof of the Kadison-Singer 
conjecture which was a major breakthrough by Marcus, Spielman, and Srivastava [15]. 

To summarize, the current best approximation algorithm has a guarantee of 
0(log n/ log log n) [4] and the best upper bound on the integrality gap of the Held-Karp 
relaxation is O (polylog log n) [1]. These two bounds are far away from the known 
inapproximability results [14] and from the lower bound of 2 on the integrality gap of 
the Held-Karp relaxation [6]. Moreover, there were no better approximation algorithms 
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known in the case of shortest path metrics of unweighted digraphs for which there was 
recent progress in the undirected setting. In particular, it is not clear how to use the two 
available approaches mentioned above to get an improved approximation guarantee in 
this case: in the cycle cover approach, the main difficulty is to bound the number of 
iterations and, in the thin spanning tree approach, ATSP is reduced to an unweighted 
graph theoretic problem. 

1.1 Our Results and Overview of Approach 

We propose a new approach for approximating the asymmetric traveling salesman 
problem based on relaxing the global connectivity constraints into local connectivity 
conditions. We also use this approach to obtain the following result where we refer to 
ATSP on shortest path metrics of node-weighted digraphs as Node-Weighted ATSP. 

Theorem 1.1. There is a constant approximation algorithm for Node-Weighted ATSP. Specifi¬ 
cally, for Node-Weighted ATSP, the integrality gap of the Held-Karp relaxation is at most 15 
and, for any e > 0, there is a polynomial time algorithm that finds a tour of weight at most 
(27 -I- e) OPThk where OPThk denotes the optimal value of the Held-Karp relaxation. 

As further discussed in Section 6, the constants in the theorem can be slightly 
improved by specializing our general approach to the node-weighted case. However, 
it remains an interesting open problem to give a tight bound on the integrality gap. 

Let us continue with a brief overview of our approach that is not restricted to the 
node-weighted version. It is illustrative to consider the following "naive" algorithm 
that actually was the starting point of this work: 

1. Select a random cycle cover C using the Held-Karp relaxation. 

It is well known that one can sample such a cycle cover C of expected weight 
equal to the optimal value OPThx of the Held-Karp relaxation. 

2. While there exist more than one component, add the lightest cycle (i.e., the cycle 
of smallest weight) that decreases the number of components. 

It is clear that the above algorithm always returns a solution to ATSP: we start with 
a Eulerian graph^ and the graph stays Eulerian during the execution of the while- 
loop which does not terminate until the graph is connected. This gives a tour that 
visits each vertex at least once and hence a solution to ATSP (using that we have 
the triangle-inequality). However, what is the weight of the obtained tour? Pirst, as 
remarked above, we have that the expected weight of the cycle cover is OPThx- So if 
C contains k = \C\ cycles, we would expect that a cycle in C has weight OPThx jk (at 
least on average). Moreover, the number of cycles added in Step 2 is at most k-1 since 
each cycle decreases the number of components by at least one. Thus, if each cycle in 
Step 2 has weight at most the average weight OPT^x jk of a cycle in C, we obtain a 
2-approximate tour of weight at most OPThx + ^ OPThx ^ 2 OPT^x- 

Unfortunately, it seems hard to find a cycle cover C so that we can always cormect it 
with light cycles. Instead, what we can do, is to first select a cycle cover C then add light 

^Recall that a directed graph is Eulerian if the in-degree equals the out-degree of each vertex. 
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cycles that decreases the number of components as long as possible. When there are no 
more light cycles to add, the vertices/cities are partitioned into Vi ,..., Vjt connected 
components. In order to make progress from this point, we would like to find a "light" 
Eulerian set F of edges that crosses the cuts {{Vi, Vi) \ i = 1,2,... ,k}. We could then 
hope to add F to our solution and continue from there. It turns out that it is very 
important what "light" means in this context. For our arguments to work, we need that 
f is selected so that the weight of the edges in each component has weight at most a 
times what the linear programming solution "pays" for the vertices in that component. 
This is the intuition behind the definitions in Section 3 of Local-Connectivity ATSP and 
a-light algorithms for that problem. We also need to be very careful in which way we 
add edges from light cycles and how to use the n-light algorithm for Local-Connectivity 
ATSP. In Section 5, our algorithm will iteratively solve the Local-Connectivity ATSP 
and, in each iteration, it will add a carefully chosen subset of the found edges together 
with light cycles. 

We remark that in Local-Connectivity ATSP we have relaxed the global connectivity 
properties of ATSP into local cormectivity conditions that only say that we need to find 
a Eulerian set of edges that crosses at most n = \V\ cuts defined by a partitioning of the 
vertices. In spite of that, we are able to leverage the intuition above to obtain our main 
technical result: 

Theorem (Simplified statement of Theorem 5.1). The integrality gap of the Held-Karp 
relaxation is at most 5a if there exists an a-light algorithm .Vi for Local-Connectivity ATSP. 
Moreover, for any e > 0, we can find a (9 + e)a-approximate tour in time polynomial in n,lle, 
and in the running time of Vi. 

The proof of the above theorem (Section 5) is based on generalizing and, as alluded to 
above, deviating from the above intuition in several ways. First, we start with a carefully 
chosen "Eulerian partition" which generalizes the role of the cycle cover C in Step 1 
above. Second, both the iterative use of the n-light algorithm for Local-Connectivity 
ATSP and the way we add light cycles are done in a careful and dependent manner so as 
to be able to bound the total weight of the returned solution. Theorem 1.1 follows from 
Theorem 5.1 together with a 3-light algorithm for Node-Weighted Local-Connectivity 
ATSP. The 3-light algorithm, described in Section 4, is a rather simple application of 
classic theory of flows and circulations. We also remark that it is the only part of the 
paper that relies on having shortest path metrics of node-weighted digraphs. 

Our work raises several natural questions. Perhaps the most immediate and 
intriguing question is whether there is a 0(l)-light algorithm for Local-Connectivity 
ATSP on general metrics. We further elaborate on this and other related questions in 
Section 6. 

2 Preliminaries 

2.1 Basic Notation 

Consider a directed graph G = {V,E). For a subset S c y, we let 6'''(S) = {{u,v) e E \ 
u e S,v ^ S}he the outgoing edges and we let 6“(S) = {{u,v) € E | w ^ S,u € S} be the 
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incoming edges of the cut defined by S. When considering a subset E' c E of the edges, 
we denote the restrictions to that subset by 6 p(S) = 6 '''(S) n E' and by 6 p(S) = 6 “(S) n E'. 
We also let C(E') = {Gi = (yi,Ei),G 2 = ( 1 / 2 , £ 2 ), • • •, Q = {%,Ek)} denote the set of 
subgraphs corresponding to the k connected components of the graph {V, E'); the vertex 
set V will always be clear from the context. Here cormected means that the subgraphs 
are connected if we undirect the edges. 

When considering a function / : Lt —> R, we let /(X) = Lxex /(^) for X <z U. For 
example, if G is edge weighted, i.e., there exists a function re : E —> R, then w{E') denotes 
the total weight of the edges in £' c E. Similarly, if G is node-weighted, then there 
exists a fimction / : F —> R and /(S) denotes the total weight of the vertices in S c y. 
When talking about graphs, we shall slightly abuse notation and sometimes write w{G) 
instead of w{E) and /(G) instead of /(V) when it is clear from the context that w and / 
are functions on the edges and vertices. Finally, our subsets of edges are multisets, i.e., may 
contain the same edge several times. The set operators U, n, \ are defined in the natural 
way. For example, { 61 , 61 , 62 } U { 61 , 62 } = { 61 , 61 , 61 , 62 , 62 }, { 61 , 61 , 62 } n { 61 , 62 } = { 61 , 62 }, and 
{ 61 , 61 , 62 } \ { 61 , 62 } = { 61 }. Other sets, such as subsets of vertices, will always be simple 
sets without any multiplicities. 

2.2 The (Node-Weighted) Asymmetric Traveling Salesman Problem 

It will be convenient to define ATSP using the Eulerian point of view, i.e., we wish to 
find a tour that visits each vertex at least once. As already mentioned in the introduction, 
this definition is equivalent to that of visiting each city exactly once (in the metric 
completion) since we assume the triangle inequality. 

ATSP 

Given: An edge-weighted (strongly connected) digraph G = (V,E,w : E ^ R"*"). 

Find: A cormected Eulerian digraph G' = (V) E') where £' is a multisubset of E that 
minimizes w{E'). 

Similar to the recent progress on STSP, it is natural to consider special cases that are 
easier to argue about but at the same time capture the combinatorial structure of the 
problem. In particular, we shall consider the Node-Weighted ATSP, where we assume 
that there exists a weight function / : 1/ —> R'*' on the vertices so that w{u,v) = f{u). 
(Another equivalent definition, which also applies to undirected graphs, is to let the 
weight of an edge (w, v) be /(w) -l- f{v). This is equivalent to the definition above, if 
scaled down by a factor of 2, since the solutions are Eulerian.) 

Note that this generalizes ATSP on shortest path metrics of unweighted digraphs: 
that is the problem where / is the constant function. As a curiosity, we also note that the 
recent progress on STSP when restricted to shortest path metrics of unweighted graphs 
is not known to generalize to the node-weighted case. We raise this as an interesting 
open problem in Section 6 . 
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2.3 Held-Karp Relaxation 

The Held-Karp relaxation has a variable > 0 for every edge in the given edge- 
weighted graph G = (V, E, w). The intended solution is that Xg should equal the number 
of times e is used in the solution. The relaxation LP(G) is now defined as follows: 

minimize L XeW{e) 

e£E 

subject to x(6'^(u)) = x(6“(u)) v eV, 

x(6+(S)) >1 0 ^ S c y, 

x>Q. 


The first set of constraints says that the in-degree should equal the out-degree for each 
vertex, i.e., the solution should be Eulerian. The second set of constraints enforces that 
the solution is connected and they are sometimes referred to as subtour elimination 
constraints. For Node-Weighted ATSP, we can write the objective function of the linear 
program as Evevfi'^)' where / : 1/ —> K"*" is the weights on the vertices that 

defines the node-weighted metric. 

Finally, we remark that although the Held-Karp relaxation has exponentially many 
constraints, it is well known that we can solve it in polynomial time either by using the 
ellipsoid method with a separation oracle or by formulating an equivalent compact 
(polynomial size) linear program. 

3 ATSP with Local Connectivity 

In this section we define a seemingly easier problem than ATSP by relaxing the 
connectivity requirements. Consider an optimal solution x* to FP(G). Its value, which 
is a lower bound on OPT, can be decomposed into a "lower bound" for each vertex v: 

J^x;w(e) = J^ xlzv(e). 

eeE veV ee6+(z;) 

^ . . 

lower bound for v 

With this intuition, we let lb : 1/ —> R be the lower bound function defined by 
lbx*,G(^) = Lee6+(z;) x^zvie). We simplify notation and write lb instead of Ihx'^c G will 
always be clear from the context and therefore also x* (if the optimal solution to FP(G) 
is not unique then make an arbitrary but consistent choice). Note that lb(y) equals the 
value of the optimal solution to the Held-Karp relaxation. 

Perhaps the main difficulty of ATSP is to satisfy the connectivity requirement, i.e., 
to select a Eulerian subset F of edges that connects the whole graph. We shall now relax 
this condition to obtain what we call Local-Connectivity ATSP: 
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Local-Connectivity ATSP 

Given: An edge-weighted (strongly connected) digraph G = (V, E, w) and a parti¬ 
tioning l/i U 1/2 U ... U Fjt of the vertices that satisfy: the graph induced by 
Vi is strongly connected for i = 

Find: A Eulerian multisubset F of E such that 

> 1 for i = 1,2,... ,k and max - J- is minimized. 

GeC(F) lb(G) 

Recall that C(F) denotes the set of connected components of the graph {V, F). We remark 
that the restriction that each Vj should induce a strongly connected component is not 
necessary but it makes our proofs in Section 4 easier. 

We say that an algorithm for Local-Connectivity ATSP is a-light if it is guaranteed 
(over all instances) to find a solution F such that 


w{G) 

max —^ < a. 
GeC(F) lb(G) 


(3.1) 


We also say that an algorithm is a-light on an ATSP instance G = (V,E,iv) if, for each 
partitioning Pi U ... U Pj;. of P (such that P, induces a strongly cormected graph), it 
returns a solution satisfying (3.1). We remark that we use the a-light terminology to 
avoid any ambiguities with the concept of approximation algorithms because an a-light 
algorithm does not compare its solution with respect to an optimal solution to the given 
instance of Local-Cormectivity ATSP. 

An a-approximation algorithm for ATSP with respect to the Held-Karp relaxation 
is trivially an a-light algorithm for Local-Connectivity ATSP: output the same Eulerian 
subset F as the algorithm for ATSP. Since the set F cormects the graph we have 
maxGg(^(F) ?^(G)/lb(G) = a;(F)/lb(P) < a. Moreover, Local-Cormectivity ATSP seems 
like a significantly easier problem than ATSP as the Eulerian set of edges only needs to 
cross k cuts formed by a partitioning of the vertices. We substantiate this intuition by 
proving, in Section 4, that there exists a simple 3-approximation for Eocal-Connectivity 
ATSP on shortest path metrics of node-weighted graphs. We refer to this case as 
Node-Weighted Local-Cormectivity ATSP. Perhaps more surprisingly, we show in 
Section 5 that any a-light algorithm for Local-Connectivity ATSP can be turned into an 
algorithm for ATSP with an approximation guarantee of 5a with respect to the same 
lower bound (from the Held-Karp relaxation). 

Remark 3.1. Our generic reduction from ATSP to Local-Connectivity ATSP (Theorem 5.1) 
is robust with respect to the definition of lb and there are many possibilities to define 
such a lower bound. Another natural example is lb(a) = Lee6+(D)u6-(D) xlw{e)l2. In fact, 
in order to get a constant bound on the integrality gap of the Held-Karp relaxation, 
our results say that it is enough to find an 0(l)-light algorithm for Local-Cormectivity 
ATSP with respect to some normegative lb that only needs to satisfy that lb(y) is at 
most the value of the optimal solution to the LP. Even more generally, if lb(y) is at most 
the value of an optimal tour then our methods would give a similar approximation 
guarantee (but not with respect to the Held-Karp relaxation). 
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Figure 1: A depiction of the construction of the auxiliary graph G' (in the proof of 
Theorem 4.1): edges are subdivided, an auxiliary vertex A, is added for each partition 
Vi, and Aj is "connected" to subdivisions of the edges in 6^ (Vi) and 6~(Vi). 

4 Approximating Local-Connectivity ATSP 

We give a simple 3-light algorithm for Node-Weighted Local-Connectivity ATSP. 
The proof is based on finding an integral circulation that sends flow across the cuts 
{(Vi, Vi) : i = 1,2,... ,k} and, in addition, satisfies that the outgoing flow of each vertex 
z? € y is at most |’v*(6'''(z;))] + 1 which in turn, by the assumptions on the metric, implies 
a 3-light algorithm. 

Theorem 4.1. There exists a polynomial time 3-light algorithm for Node-Weighted Local- 
Connectivity ATSP. 

Proof. Let G = (V,E,w) and Pi U 1^2 U ... U be an instance of Local-Connectivity 
ATSP where w : E R'*' is a node-weighted metric defined by / : P —> R"*". Let also x* 
be an optimal solution to LP(G). We prove the theorem by giving a polynomial time 
algorithm that finds a Eulerian multisubset F of E satisfying 

\5f(^i)\ > 1 for i = 1,... ,k and |6p(z7)| < rp(6'^(z7))l -l-1 for i? € P. (4.1) 

To see that this is sufficient, note that the Eulerian set F forms a solution to the Local- 
Cormectivity ATSP instance because |6p(P0l ^ 1 for f = 1,...,/c; and it is 3-light since, 
for each G = (P, E) e C(F), we have (using that it is a node-weighted metric) 

u;(G) ^ ^ L,ev(f^wm + m^) ^ ^ 

lb(G) Lz,£v^*('5+(z;))/(z;) 

The last inequality follows from x*((5'''(z’)) > 1 and therefore rp((5'''(z’))] -l-1 < 3P(6'''(z7)). 

We proceed by describing a polynomial time algorithm for finding a Eulerian set F 
satisfying (4.1). We shall do so by finding a circulation in an auxiliary graph G' obtained 
from G as follows (see also Eigure 1): 

1. Replace each edge e = (u,v) in G by adding vertices Oe,Ie and edges 
(u, Og), (Og, Ig), (Ig, v)} 

2. Eor each partition P/, f = 1,..., k, add an auxiliary vertex Aj and edges (A;, Og) for 
every e e 6'^(Vi) and (L, A;) for every e e 6~(Vi). 
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Recall that a circulation in G' is a vector y with a nonnegative value for each edge 
satisfying flow conservation: y((5'''(!7)) = y(b~(p)) for every vertex v. The following 
claim follows from the construction of G together with basic properties of flows and 
circulations. 

Claim 4.2. We can in polynomial time find an integral circulation y in G satisfying: 

y{5'^{Ai)) = 1 for i = 1,... ,k and y((5'^(r’)) < rz*(6'^(z7)l for r; € V. 

Proof. We use the optimal solution x* to LP(G) to define a fractional circulation y' in G' 
that satisfies the above degree bounds. As the vertex-degree bounds are integral, it 
follows from basic facts about flows that we can in polynomial time find an integral 
circulation y satisfying the same bounds (see e.g. Chapter 11 in [18]). Circulation y' is 
defined as follows: 

1. for each edge e = (u, v) in G with u,v E Vf. 

y'iu.O,) = y'(OeM = y'(le,v) = ^{U,V) “ X*(6+(y0) 


2. for each edge e = {u, v) in G with u E Vi, v E V j where i + j: 


y'(OM 

~ ^{u,v)' 



^(u,v) 


y{Ai,o,) 

~ r{b^{Vi))' 

y(u,Oe) ~ ^{U,V) H 


^{u,v) 


y(VAi) 

~ xfb^iVj))' 

1 

II 




x*mvj)) 


Basically, y' is defined so that a fraction llx*{b^{Vi)) of the flow crossing the cut 

y' \ ^i) goes through A,. As x*(6'''(Vi)) > 1 we have that y' is nonnegative. It is also 
immediate from the definition of y' that it satisfies flow conservation and the degree 
bounds of the claim: the in- and out-flow of a vertex v E Vj is (^1 - 
the in- and out-flow of an auxiliary vertex A, is 1 by design; and the in- and out-flow 
of Oe and f for e = (u,v) is (1 - 1 j x* {V i)))xl if u,v E Vi for some i = 1,... ,k and 

X* otherwise. As mentioned above, the existence of fractional circulation y' implies 
that we can also find, in polynomial time, an integral circulation y with the required 
properties. □ 

Having found an integral circulation y as in the above claim, we now obtain the 
Eulerian subset F of edges. Initially, the set F contains y^o^M multiplicities of each edge 
e in G. Note that with respect to this edge set, in each partition W, either all vertices in 
Vi are balanced (each vertex's in-degree equals its out-degree) or there exist exactly 
one vertex u so that |6p(M)| - \bp{u)\ = -1 and one vertex v so that |6p(i7)| - |(5p(z7)| = 1. 
Specifically, let u be the head of the unique edge e such that = 1 arid let v be the 

tail of the unique edge e' so that y^AiP,,) = 1. If w = then all vertices in W are balanced. 
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Otherwise u is so that |6p(M)l “ |6p(w)l = “1 and v is so that |6p(!^)| - |6p(!^)| = 1. In that 
case, we add a simple path from w to f to make the in-degrees and out-degrees of these 
vertices balanced. As the graph induced by Vi is strongly connected, we can select the 
path so that it only visits vertices in Vi. Therefore, we only increase the degree of vertices 
in Vi by at most 1. Hence, after repeating this operation for each partition Vi, we have 
that F is a Eulerian subset of edges and |6f (6'''(u))| < y{6'''{v)) -l-1 < -l-1 for all 

V eV. Finally, we have 16^ (V';)| > 1 for each i = because yA„Oc = 1 (and therefore 

yOe,ic ^ I) <3rie edge e e 6'^(Vi). We have thus given a polynomial time algorithm that 
finds a Eulerian subset F satisfying the properties of (4.1), which, as discussed above, 
implies that it is a 3-light algorithm for Node-Weighted Eocal-Connectivity ATSP. □ 

5 From Local to Global Connectivity 

In this section, we prove that if there is an a-light algorithm for Local-Connectivity 
ATSP, then there exists an algorithm for ATSP with an approximation guarantee of 
0{a). The main theorem can be stated as follows. 

Theorem 5.1. Let .91 be an algorithm for Local-Connectivity ATSP and consider an ATSP 
instance G = (V, E, w). IfLR is a-light on G, there exists a tour ofG with value at most 5a lb(y). 
Moreover, for any e > 0, a tour of value at most (9 -l- e)a lb(y) can be found in time polynomial 
in the number n = |y| of vertices, in 1/e, and in the running time of .91. 

Throughout this section, we let G = (V, E, w) and .91 be fixed as in the statement 
of the theorem. The proof of the theorem is by giving an algorithm that uses as a 
subroutine. We first give the non-polynomial algorithm in Section 5.1 (with the better 
guarantee) followed by Section 5.2 where we modify the arguments so that we also 
efficiently find a tour (with slightly worse guarantee). 

5.1 Existence of a Good Tour 

Before describing the (non-polynomial) algorithm, we need to introduce the concept 
of Eulerian partition. We say that graphs Hi = (Vi,E-i),H 2 = (1^2/£ 2 )/ ■■■ ,iik = {^k,Lk) 
form a Eulerian partition of G if the vertex sets Vi, ..., form a partition of V and each 
Eli is a connected Eulerian graph where E; is a multisubset of E. It is an jl-light Eulerian 
partition if in addition 


w{Eli) < fl ■ \h{Hi) for f = 1,..., k. 

Our goal is to find a 5a-light Eulerian partition that only consists of a single component, 
i.e., a 5a-approximate solution to the ATSP instance G with respect to the Held-Karp 
relaxation. 

The idea of the algorithm is to start with a Eulerian partition and then iteratively 
merge/connect these connected components into a single connected component by 
adding (cheap) Eulerian subsets of edges. Note that, since we will only add Eulerian 
subsets, the algorithm always maintains that the connected components are Eulerian. 

The state of the algorithm is described by a Eulerian multiset E* that contains the 
multiplicities of the edges that the algorithm has picked. 
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Initialization. The algorithm starts with a 2a-light Eulerian partition H* = 
(Vp , H* = (y*, that maximizes the lexicographic order of 

<lb(Hp,lb(H2),...,lb(Hp). (5.1) 

As the lexicographic order is maximized, the Eulerian partitions are ordered so that 
lb(Hp > lb(Hp > • • • > lb(Hp. For simplicity, we assume that these inequalities are strict 
(which is w.l.o.g. by breaking ties arbitrarily but consistently). The set £* is initialized 
so that it contains the edges of the Eulerian partitions, i.e., £* = £* U £* U • • • U £*. 

During the execution of the algorithm we will also use the following concept. For 
a connected subgraph G = (V, E) of G, let low(G) denote the Eulerian partition H* of 
lowest index i that intersects G^. That is. 


low(G) = H* . 

Note that after initialization, the connected components in C(£*) are exactly the 
subgraphs H*,...,H*. This means that H*. = low(G) for exactly one component 
G e C(£*). Moreover, as the algorithm will only add edges, each H*. will be in at most 
one component throughout the execution. 

Remark 5.2. The main difference in the polynomial time algorithm is the initialization 
since we do not know how to find a 2a-light Eulerian partition that maximizes the 
lexicographic order in polynomial time. Indeed, it is consistent with our knowledge 
that 2a (even 2) is an upper bound on the integrality gap and, in that case, such an 
algorithm would always find a tour. 

Remark 5.3. For intuition, let us mention that the reason for starting with a Eulerian 
partition that maximizes the lexicographic order is that we will use the following 
properties to bound the weight of the total tour: 

1. A connected Eulerian subgraph H of G with zv{H) < 2alb(H) has lb(H) < 
lb(low(H)). 

2. For any disjoint cormected Eulerian subgraphs Hi, H 2 ,..., H^ of G with low(Hy) = 
H* and w{Hj) < a Ib(Hy) for ; = 1,..., i, we have 

i 

J]^lb(Hy)<21b(Hp. 

1=1 

These bounds will be used to bound the weight of the edges added in the merge 
procedure. Their proofs are easy and can be found in the analysis (see the proofs of 
Claim 5.8 and Claim 5.9). 

^Equivalently, it is the set YT. maximizing lb(H*) over all sets in the Eulerian partition that intersect G. 
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Merge procedure. The algorithm repeats the following "merge procedure" until C(£*) 
contains a single connected component. The components in C{E*) partition the vertex 
set and each component is strongly connected as it is Eulerian (since E* is a Eulerian 
subset of edges). The algorithm can therefore use ^ to find a Eulerian multisubset F of 
£ such that 

(i) |6+(1/)| > 1 for all (V, E) e C(E*); and 

(ii) for each G e C(F) we have w(G) < a lb(G). 

Note that M is guaranteed to find such a set since it is assumed to be an a- light 
algorithm for Local-Connectivity ATSP on G. Furthermore, we may assume that no 
connected component in C(F) is completely contained in a connected component in 
C(E*) (except for the trivial components formed by singletons). Indeed, the edges of 
such a component can safely be removed from F and we have a new (smaller) multiset 
that satisfies the above conditions. Having selected F, we now proceed to explain the 
"update phase": 

Ul: LetX = 0. 

U2: Select the component G = (V,E) e C(E* U F U X) that minimizes lb(low(G)). 

U3: If there exists a cycle C = {Vq, Eq) in G of weight w{C) < a lb(low(G)) that cormects 
G to another component in C(£* U F U X), then add £c to X and repeat from 
Step U2. 

U4: Otherwise, update E* by adding the "new" edges in £, i.e., £*<—£* U (EnF) U (EnX). 

Some comments about the update of £* are in order. We emphasize that we do 
not add all edges of F U X to £*. Instead, we only add those new edges that belong 
to the component G selected in the final iteration of the update phase. As G is a 
connected component in C(£* U F U X), F and X are Eulerian subsets of edges, we have 
that £* remains Eulerian after the update. This finishes the description of the merging 
procedure and the algorithm (see also the example below). 

Example 5.4. In Figure 2, we have that, at the start of a merging step, C(E*) consists of 
6 components containing {Hg}, {H^Hg}, and {H^}. The blue 

(solid) cycles depict the connected Eulerian components of the edge set F. First, we 
set X = 0 and the algorithm selects the component G in C(£* U F U X) that minimizes 
lb(low(G)) or, equivalently, that maximizes min{i: FF* intersects G}. In this example, it 
would be the left most of the three components in C(£* U F) with low(G) = FF*. The 
algorithm now tries to connect this component to another component by adding a 
cycle with weight at most a lb(FFp. The red (dashed) cycle corresponds to such a cycle 
and its edge set is added to X. In the next iteration, the algorithm considers the two 
components in C(£* U F U X). The smallest component (with respect to lb(low(G))) is 
the one that contains FF*, FF*, and FF*. Now suppose that there is no cycle of weight at 
most a lb(Ff*) that connects this component to another component. Then the set E* is 
updated by adding those edges of F U X that belong to this component (depicted by 
the thick cycle). 
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Figure 2: An illustration of the merge procedure. Blue (solid) cycles depict F and the 
red (dashed) cycle depicts X after one iteration of the update phase. The thick cycle 
represents the edges that this merge procedure would add to E*. 

5 . 1.1 Analysis 

Termination. We show that the algorithm terminates by arguing that the update phase 
terminates with fewer cormected components and the merge procedure is therefore 
repeated at most /c < n times. 

Lemma 5.5. The update phase terminates in polynomial time and decreases the number of 
connected components in C{E*). 

Proof. First, observe that each single step of the update phase can be implemented 
in polynomial time. The only nontrivial part is Step U3 which can be implemented 
as follows: for each edge {u,v) e 6'''(V) consider the cycle consisting of (u,v) and a 
shortest path from v to u. Moreover, the whole update phase terminates in polynomial 
time because each time the if-condition of Step U3 is satisfied, we add a cycle to X that 
decreases the number of cormected components in C(£* U F U X). The if-condition of 
Step U3 can therefore be satisfied at most k ^n times. 

We proceed by proving that at termination the update phase decreases the number 
of cormected components in C(£*). Consider when the algorithm reaches Step U4. In 
that case it has selected a component G = (V, E) e C(E* U F U X). Note that G ^ C(E*) 
because the edge set F crosses each cut defined by the vertex sets of the cormected 
components in C(F*). Therefore when the algorithm updates E* by adding all the edges 
(F U X) n F it decreases the number of components in C(E*) by at least one. □ 

Performance Guarantee. To analyze the performance guarantee we shall split our 
analysis into two parts. Note that when one execution of the merge procedure 
terminates (Step U4) we add edge set (F n F) U (X n F) to our solution. We shall 
analyze the contribution of these two sets F D E and X n £ separately. More formally, 
suppose that the algorithm does T repetitions of the merge procedure. Let Gi = 
(^^ 1 /Fi)/G 2 = (V2,E2), ... ,Gt = (Vt,Et), Fi,F 2 , ... ,Ft, and Xi,X 2 , ...,Xt denote the 
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selected components, the edge set F, and the edge set X, respectively, at the end of each 
repetition. To simplify notation, we denote the edges added to E* in the f:th repetition 
by Ft = Ft n Ft and Xt = Xt D Ft. 

With this notation, we proceed to bound the total weight of the solution by 

k 

w + w < 5a lb(y) as claimed in Theorem 5.1. 

<2a lb(y) by Lemma 5.7 lb(V) by Lemma 5.6 

Here we used that Y!i=\ ^ 2a lb(y) since H*,..., H* is a 2a-light Eulerian partition. 

It remains to prove Lemmas 5.6 and 5.7. 

Lemma 5.6. We have w < a lb(y). 

Proof. Note that Xt consists of a subset of the cycles added to Xf in Step U3 of the update 
phase. Specifically, those cycles contained in the connected component Gt selected 
at Step U2 in the last iteration of the update phase during the f:th repetition of the 
merge procedure. We can therefore decompose U^^Xf into cycles Ci = (V'i,£i),C 2 = 
{V 2 , £ 2 )/ ■ ■ • / Cc = (Vc, Ec) indexed in the order they were added by the algorithm. When 
Cj was selected in Step U3 of the update phase, it satisfied the following two properties: 

(i) it cormected the component G selected in Step U2 with at least one other component 
G' such that lb(low(G')) > lb(low(G)); and 

(ii) it had weight w{Cj) at most a lb(low(G)). 

In this case, we say that Cj marks low(G). 

We claim that at most one cycle in Ci, C 2 , ■■■,Cc marks each H*, H*,..., To see 
this, consider the first cycle Cj that marks H* (if any). By (i) above, when Cj was 
added, it connected two components G and G' such that lb(low(G')) > lb(low(G)) 
where low(G) = H*. As the algorithm only adds edges, G and G' will remain connected 
throughout the execution of the algorithm. Therefore, by the definition of low and by 
the fact that lb(low(G')) > lb(low(G)), we have that a component G" appearing later in 
the algorithm always has low(G") FI*. Hence, no other cycle marks FI*. 

The boimd now follows from that at most one cycle marks each FI* and such a cycle 
has weight at most a lb (H*). □ 

We complete the analysis of the performance guarantee with the following lemma. 

Lemma 5.7. We have w < 2a lb(y). 

Proof. Consider the f:th repetition of the merge procedure. The edge set Ft is Eulerian 
but not necessarily connected. Let "F* denote the set of the Eulerian subgraphs 
corresponding to the connected components in C{Ft) where we disregard the trivial 
components that only consist of a single vertex. Lurther, partition F* into F.^, F^, ■ ■ ■ >^1 
where F^ contains those Eulerian subgraphs in F^ that intersect FP. and do not intersect 
any of the subgraphs H*, H*^,..., H*_^. That is, 

Fl = {HeF^: low(H) = H*}. 
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Note that the total weight of Ft, w{Pt), equals w{T'^) = Y!i=\ We bound the weight 

of by considering each ‘Fj separately. We start by two simple claims that follow 
from that each H satisfies w{H) ^ a lb(H) (since is an n-light algorithm) and 
the choice of H*,..., H* to maximize the lexicographic order of (5.1). We remark that 
the proofs of the following claims are the only arguments that use the fact that the 
lexicographic order was maximized. 

Claim 5.8. For H e T^, we have lb(H) < lb(low(H)) = lb(Hp. 

Proof. Inequality lb(H) > lb(H*) together with the fact that zv{H) < alb(H) < 2ci:lb(H) 
would contradict that H *,... was chosen to maximize the lexicographic order of (5.1). 
Indeed, in that case, a 2a-light Eulerian partition of higher lexicographic order would 
be HpH *,... and the remaining vertices (as trivial singleton components) that 

do not belong to any of these Eulerian subgraphs. □ 

Claim 5.9. We have lb(!F)0 < 21b(H*). 

Proof. Suppose toward contradiction that lb(!F).^) > 21b(H*). Let .. .,H{} 

and define H* to be the Eulerian graph obtained by taking the union of the graphs H* and 
Hi,... ,H{. Consider the Eulerian partition H*,..., H*_^, H* and the remaining vertices 
(as trivial singleton components) that do not belong to any of these Eulerian subgraphs. 
We have lb(H*) > lb(H*) and therefore the lexicographic value of this Eulerian partition 
is larger than the lexicographic value of H*,... ,H*. This is a contradiction if it is also a 

2a-light Eulerian partition, i.e., if < 2a. 

Therefore, we must have w{H*) > 2a lb(H*). By the facts that w{Hj) < a Ib(Hy) (since 
FI is an a-light algorithm) and that H*,... ,H* is a 2a-light Eulerian partition, 

t t t 

w(H*) = w{H*) + Yj MHj) < 2a lb(H;) + Y^ ^ Yj 

j=i j=i j=i 

These inequalities together with w(H*) > 2alb(H*) imply lb(!F.^) = Ly=ilb(Hy) < 
21b(H;). ' □ 

Using the above claim, we can write zv as 

T k T k k k 

=“Tj Tj L ■’’W- 

f=l i=l t=l i=l 1=1 '=1 

We complete the proof of the lemma by using Claim 5.8 to prove that is non-empty 
for at most one repetition t of the merge procedure. Suppose toward contradiction that 
there exist 1 < fo < U ^ E so that both FI° + 0 and FI"" 0. In the fodh repetition of 
the merge procedure, H* is contained in the subgraph Gtg since otherwise no edges 
incident to H* would have been added to E*. Therefore lb(low(GtQ)) > lb(H*). Now 
consider a Eulerian subgraph H e F^f. First, we cannot have that H is contained in the 
component Gtg since each (nontrivial) component of F is assumed to not be contained 
in any component of C{E*). Second, by Claim 5.8, we have w{H) < a lb(H) < a lb(H*). 
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In short, H is a Eulerian subgraph that connects Gtg to another component and it 
has weight at most a lb(low(Gfp)). As H is Eulerian, it can be decomposed into cycles. 
One of these cycles, say C, cormects Gtg to another component and 

zv{G) < zviH) ^ a lb(H;) ^ a lb(low(GtJ). (5.2) 

In other words, there exists a cycle G that, in the fodh repetition of the merge procedure, 
satisfied the if-condition of Step U3, which contradicts the fact that Step U4 was reached 
when component Qg was selected. □ 

5.2 Polynomial Time Algorithm 

In this section we describe how to modify the arguments in Section 5.1 to obtain an 
algorithm that runs in time polynomial in the number n of vertices, in 1 je, and in the 
rurming time of 

By Lemma 5.5, the update phase can be implemented in polynomial time in n. 
Therefore, the merge procedure described in Section 5.1 runs in time polynomial in 
n and in the running time of The problem is the initialization: as mentioned in 
Remark 5.2, it seems difficult to find a polynomial time algorithm for finding a 2a-light 
Eulerian partition H*,... ,H* that maximizes the lexicographic order of 

<lb(Hp,lb(H;),...,lb(Hp). 

We overcome this obstacle by first identifying the properties that we actually use 
from selecting the Eulerian partition as above. We then show that we can obtain a 
Eulerian partition that satisfies these properties in polynomial time. 

As mentioned in the analysis in Section 5.1, the only place where we use that 
the Eulerian partition maximizes the lexicographic order of (5.1) is in the proof of 
Lemma 5.7. Specifically, it is used in the proofs of Claims 5.8 and 5.9. Instead of 
proving these claims, we shall simply concentrate on finding a Eulerian partition that 
satisfies a relaxed variant of them (formalized in the lemma below, see Condition (5.3)). 
The claimed polynomial time algorithm is then obtained by first proving that a slight 
modification of the merge procedure returns a tour of value at most {9a + 2c) lb(y) 
if Condition (5.3) holds, and then we show that a Eulerian partition satisfying this 
condition can be found in time polynomial in n and in the running time of We start 
by describing the modification to the merge procedure. 

Modified merge procedure. The only modification to the merge procedure in Sec¬ 
tion 5.1 is that we change the update phase by relaxing the condition of the if-statement 
in Step U3 from w{G) < alb(low(G)) to 'w{G) < a(3 lb(low(G)) -l- £lb(F)/n). In other 
words. Step U3 is replaced by 

U3': If there exists a cycle C = (yc/^c) inGof weighta;(C) < a(31b(low(G))-l-£lb(y)/n) 
that connects G to another component in C{E* U F U X), then add Eq to X and 
repeat from Step U2. 
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Clearly the modified merge procedure still runs in time polynomial in n and in 
the rurming time of J?l. Moreover, we show that if Condition (5.3) holds then the 
returned tour will have weight 0{a). Recall from Section 5.1 that ft denotes the 
subset of F and Xf denotes the subset of X that were added in the f:th repetition of 
the (modified) merge procedure. Furthermore, we define (as in the previous section) 
= {H e C(Ft) : low(H) = H* and H is a nontrivial component, i.e., H contains more 
than one vertex}. 

Lemma 5.10. Assume that the algorithm is initialized with a 3a-light Eulerian partition 
HpH*,... ,H* so that, in each repetition t of the modified merge procedure, we add a subset Ft 
such that 

Ihirl) < 3 lb(H;) + for i = 1,2,..., k. (5.3) 

n 

Then the returned tour has weight at most (9 + 2e)a lb(y). 

Let us comment on the above statement before giving its proof. The reason that we 
use a 3a-light Eulerian partition (instead of one that is 2a-light) is that it leads to a better 
constant when balancing the parameters. We also remark that (5.3) is a relaxation of the 
bound of Claim 5.9 from \hfF-) < 2 lb(Hp to Ib(Fl^) < 3 lb(Hp + e lb(l/)/n; and it also 
implies a relaxed version of Claim 5.8: fromlb(H) < lb(Hptolb(H) < 3\h{H*)+e\h(V)/n. 
It is because of this relaxed bound that we modified the if-condition of the update 
phase (by relaxing it by the same amount) which will be apparent in the proof. 

Proof. As in the analysis of the performance guarantee in Section 5.1, we can write the 
weight of the returned tour as 

k 

w (u[^^Ft) + w ^ w{H*f). 

i=l 

To bound we observe that proof of Lemma 5.6 generalizes verbatim 

except that the weight of a cycle C that marks H*. is now bounded by a{3 \h{H* )+£ lb(y) jn) 
instead of by a lb(Hp (because of the relaxation of the bound in the if-condition of the 

update phase). Hence, zv(ul^Xt) < L^=i ct(31b(Hp-l-£lb(y)/n) < (3-l-£)a lb(y) because 
/c < n. 

We proceed to bound Using the same arguments as in the proof of 

Lemma 5.7, 

k k 

MTl) 

!=1 '=1 

where, for the last inequality, we used the assumption of the lemma. Now we apply 
exactly the same arguments as in the end of the proof of Lemma 5.7 to prove that T'l is 
non-empty for at most one repetition t of the merge procedure. The only difference, is 
that (5.2) should be replaced by 

w{C) < w{H) ^ a{3\h{H*) -l- £lb(y)/n) < a(31b(low(Gtp)) -l- £lb(y)/n) 
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(because (5.3) can be seen as a relaxed version of Claim 5.8). However, as we also 
updated the bound in the if-condition, the argument that C would satisfy the if-condition 
of Step U3' is still valid. Hence, we conclude that is non-empty in at most one 
repetition and therefore 

k 

i: (31b(H;) + e\h{V)/n) < (3 + e)a\h{V). 
i=l 

By the above bounds and since H*,!-!*,... ,H* is a 3a-light Eulerian partition, we 
have that the weight of the returned tour is 

k 

w + w MH*) < (3 + e)a lb(y) + (3 + e)a lb(y) + 3a lb(y) 

;=1 

= (9 + 2£)alb(y). 


□ 

Finding a good Eulerian partition in polynomial time. By the above lemma, it is 
sufficient to find a 3a-light Eulerian partition so that Condition (5.3) holds during the 
execution of the modified merge procedure. However, how can we do it in polynomial 
time? We do as follows. First, we select the trivial 3a-light Eulerian partition where 
each subgraph is only a single vertex. Then we run the modified merge procedure and, 
in each repetition, we verify that Condition (5.3) holds. Note that this condition is easy 
to verify in time polynomial in n. If it holds until we return a tour, then we know by 
Lemma 5.10 that the tour has weight at most (9 + 2e)a lb(y). If it does not hold during 
one repetition, then we will restart the algorithm with a new 3a-light Eulerian partition 
that we find using the following lemma. We continue in this manner until the merge 
procedure executes without violating Condition (5.3) and therefore it returns a tour of 
weight at most (9a -l- le) lb(y). 

Lemma 5.11. Suppose that repetition t of the (modified) merge procedure violates Condition (5.3) 
when run starting from a 3a-light Eulerian partition ... ,H*. Then we can, in time 

polynomial in n,find a new 3a-light Eulerian partition ... ,H! so that 

k k 2 

Y, lb(Hp' - Y ^ ^ (5-4) 

j=l j=l 

Note that the above lemma implies that we will reinitialize (in polynomial time) the 
Eulerian partition at most 3n^ times because any Eulerian partition has 

Ltiib(H;)2< lb(y)^. As each execution of the merge procedure takes time polynomial 
in n and in the rurming time of J?l, we can therefore find a tour of weight at most 
(9 + 2e)a lb(y) = (9 -l- £')a lb(y) in the time claimed in Theorem 5.1, i.e., polynomial in 
n, 1 !e', and in the running time of It remains to prove the lemma. 
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Proof. Since the t:th repetition of the merge procedure violates Condition (5.3), there is 
an 1 < f ^ /c such that 

\h{rl) > + -\h{V). 

n 

We shall use this fact to construct a new 3a-light Eulerian partition consisting of 
a new Eulerian subgraph H* together with a subset of ... ,H*} containing 

those subgraphs that do not intersect H* and finally the vertices (as trivial singleton 
components) that do not belong to any of these Eulerian subgraphs. We need to define 
the Eulerian subgraph H*. Let I c {1,2,..., A:} be the indices of those Eulerian subgraphs 
of H*,... ,H* that intersect the vertices in 'p'f Note that, by definition, we have i € I 
and j > i for all j € J. We shall construct the graph H* iteratively. Initially, we let H* be 
the connected Eulerian subgraph obtained by taking the union of and H*. This is a 
connected Eulerian subgraph as each Eulerian subgraph in 'Fl intersects H* and H* is a 
connected Eulerian subgraph. 

The careful reader can observe that up to now H* is defined in the same way as 
in the proof of Claim 5.9. However, in order to satisfy (5.4) we shall add more of 
the Eulerian subgraphs in {H*}jei to H*. Specifically, we would like to add 
where I' c I \ {/} is selected so as to maximize lb(H*) (because we wish to increase the 
"potential" in (5.4)) subject to that w{H*) < 3a lb(H*) (because the new Eulerian partition 
should be 3a-light). 

To see that w{H*) < 3a lb(H*) implies that the new Eulerian partition is 3a-light, 
recall that the new Eulerian partition consists of H*, the Eulerian subgraphs and 

the vertices that do not belong to any of these Eulerian subgraphs. By the definition of I, 
no H* with j i I intersects H*. As H *,... are disjoint, it follows that the new Eulerian 
partition consists of disjoint subgraphs. Moreover, each H* satisfies w(Hp < 3a lb(Hp 
since the Eulerian partition we started with is 3a-light. Hence, the new Eulerian 
partition is 3a-light if w{H*) < 3a lb(H*). Inequality (5.5) is thus a sufficient condition 
for the new Eulerian partition to be 3a-light. We remark that the condition trivially 
holds for J' = 0 because Vo{Fl) > 3 lb(Hp + e lb(y)/n. 

Claim 5.12. We have w{H*) < 3a lb(H*) if 

lb(H* n 5-p < ^ lb(!F:p - lb(H; n 5-p. (5.5) 

jer 


Proof. We have 

w(H*) = zv{rl) + w(H;) + Y + 3a lh(H*) + 3aY 

jer jer 

where the inequality follows from that was selected by the a-light algorithm F and 
H*,... ,H* is a 3a-light Eulerian partition. Moreover, 

lb(H*) = IhiFi) + mh; \FI) + Y lb(H; \ FI). 

jer 
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Hence, we have, by rearranging terms and using lb(H*) - lb(H’t \ = lb(H* n that 

w{H*) < 3a lb(H*) holds if 

3a lb(H; n rl) + \h{H* n rl) < 2a lh{rl). 

jer 


The above can be simplified to 

ib(H* nrl)^2 \h{rl)/3 - ib(H; n rl). 

jer 


□ 

From the above discussion, we wish to find a subset J' c f \ {f} that satisfies (5.5) 
and maximizes 


lb(H‘) = lb(5^0 + lb(H) \ 9-/) + Y mh; \ rl), 

jer 

where only the last term depends on the selection of I'. We interpret this as a knapsack 
problem that, for each j e I \ {i}, has an item of size sj = lb(H* n 'F*) and profit 

Pj = \h{H* \ Fj)', the capacity U of the knapsack is | ^{Fj) - lb(H* n Fj), i.e., the 
right-hand-side of (5.5). We solve this knapsack problem and obtain F as follows: 

1. Find an optimal extreme point solution z* to the standard linear programming 
relaxation of the knapsack problem: 

maximize E ^iVj 
;eA|!i 

subject to Y_^ ^j^j ^ 
i'eAli) 

0 < Zy < 1 for all / e I \ {i}. 

2. As the above relaxation has only one constraint (apart from the boundary 
constraints), the extreme point z* has at most one variable with a fractional value. 
We obtain an integral solution (i.e., a packing) by simply dropping the fractionally 
packed item. That is, we let F = [j e I \ {/} : z* = 1}. 

The running time of the above procedure is dominated by the time it takes to solve the 
linear program. This can be done very efficiently by solving the fractional knapsack 
problem with the greedy algorithm (or, for the purpose here, use any general polynomial 
time algorithm for linear programming). We can therefore obtain F and the new Eulerian 
partition in time polynomial in |f| < n as stated in lemma. 

It remains to prove (5.4). Let us first bound the profit of our "knapsack solution" F. 

Claim 5.13. We have Ljer lb(H* \ F^) > i Ljei\i MH] \ F*) - lb(H;). 
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Proof. By definition. 


jel' 


L 


Pi 






L 


Z-pj - max pj, 


where we used that at most one item is fractionally packed in z*. As j > i for all j € 1, 
maxjg 7 \j,) Pj = maxygj\j;) lb(H* \ T'-) < lb(Hp. To complete the proof of the claim, it is 
thus sufficient to prove that z' = 1/3 for all j e J \ {i} is a feasible solution to the LP 
relaxation of the knapsack problem. Indeed, by the optimality of z*, we then have 
I/eAfl ^)Pi > T.jem ^'jPi = 5 T.jel\\i] MH* \ Tl). 

We have that z' is a feasible solution because 


I L ib(H-nA)<;hb(A)« I 


ib(H;n:T:0' 
Mrl) ^ 


Mrl) = u, 


where the first inequality follows from that the subgraphs are disjoint and the 

second inequality follows from that lb(Hp/ lb(!Fj.^) <1/3. □ 

We finish the proof of the lemma by using the above claim to show the increase 
of the "potential" function as stated in (5.4). By the definition of the new Eulerian 
partition (it contains we have that the increase is at least 


Let us concentrate on the first term: 

( 


lb(H*)2 = 


i^i' 

( 




i^i' 


By Claim 5.13, we have that the expression inside the parenthesis is at least 

MTl) + ib(H; \ rl)+l Yj \ 

>Mr!) + -MHd- 


By using lb(Hp < lb(!Tj^)/3, we can further lower bound this expression by 

( 

bi'Fh + V Ihi/TA'Fh = . 

3 ' 3 




' 

\h{ri)+Y \ =3 +\Y 
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Finally, as lb(!FlO ^ ^ lb(^)/^/ lb(!FlO > 3 lb(Hp, and lb(Hp < lb(Hp for all j e I, we have 
lb(H*)2 - IhiHf > lb(H*)2 - lb(H;) Y MH]) 






> urh 


^ib(:^V^X,ib(Hp -\h{H;)YMH]) 

jel jel 


HTh^ MTh „ „ 

> Y L ^^(^P 








which completes the proof of Lemma 5.11. 


□ 


6 Discussion and Open Problems 

We gave a new approach for approximating the asymmetric traveling salesman problem. 
It is based on relaxing the global cormectivity requirements into local cormectivity 
conditions, which is formalized as Local-Connectivity ATSP. We showed a rather easy 3- 
light algorithm for Local-Connectivity ATSP on shortest path metrics of node-weighted 
graphs. This yields via our generic reduction a constant factor approximation algorithm 
for Node-Weighted ATSP. However, we do not know any 0(l)-light algorithm for 
Local-Connectivity ATSP on general metrics and, motivated by our generic reduction, 
we raise the following intriguing question: 

Open Question 6.1. Is there a 0(l)-light algorithm for Local-Cormectivity ATSP on 
general metrics? 

We note that there is great flexibility in the exact choice of the lower bound lb as 
noted in Remark 3.1. A further generalization of our approach is to interpret it as a 
primal-dual approach. Specifically, it might be useful to interpret the lower bound as a 
feasible solution of the dual of the Held-Karp relaxation: the lower bound is then not 
only defined over the vertices but over all cuts in the graph. We do not know if any of 
these generalizations are useful at this point and it may be that there is a nice 0(l)-light 
algorithm for Local-Cormectivity ATSP without changing the definition of lb. 

By specializing the generic reduction to Node-Weighted ATSP, it is possible to 
improve our bounds slightly for this case. Specifically, one can exploit the fact that a 
cycle C always has w{C) < lb(C) in these metrics. This allows one to change the bound 
in Step U3 of the update phase to be w{C) < lb(low(G)) instead of w{C) ^ a lb(low(G)), 
which in turn improves the upper bound on the integrality gap of the Held-Karp 
relaxation to 4 • a -I- 1 = 13 (since a = 3 for node-weighted metrics). That said, we do 
not see how to make a significant improvement in the guarantee and it would be very 
interesting with a tight analysis of the integrality gap of the Held-Karp relaxation for 
Node-Weighted ATSP. We believe that such a result would also be very interesting even 
if we restrict ourselves to shortest path metrics of unweighted graphs. 
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Finally, let us remark that the recent progress for STSP on shortest path metrics 
of unweighted graphs is not known to extend to node-weighted graphs, i.e., Node- 
Weighted STSP. Is it possible to give a (1.5 - £)-approximation algorithm for Node- 
Weighted STSP for some constant £ > 0? We think that this is a very natural question 
that lies in between the now fairly well understood STSP on shortest path metrics 
of unweighted graphs and STSP on general metrics (i.e., edge-weighted instead of 
node-weighted graphs). 
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